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This  paper  considers  an  optimal  regulator  problem  which  is  different 
from  the  conventional  linear  quadratic  cost  problem  but  leads  to  a stable 
linear  feedback  control.  The  problem  considers  a double  integral  quadratic 
cost  function  and  an  integral  constraint  on  state  trajectories.  The  optimal 
open-loop  control  is  transformed  to  a closed-loop  control  and  subsequently 
a modified  control  is  obtained  based  on  a receding  horizon  notion.  This 
modified  control  law  is  shown  to  be  as3miptotically  stable  and  to  result  in 
a new  method  for  stabilizing  linear  time-varying  systems,  in  addition  to  the 
methods  of  [1]  and  [2],  as  well  as  providing  an  easy  means  to  stabilize  time- 
invarient  systems  comparable  to  the  method  of  [3].  Moreover,  the  gain  matrix 
for  the  modified  control  is  obtained  from  a Riccati-type  equation  over  a 
finite  time  interval,  and  a large  class  of  nonlinearities  can  be  allowed  in 
the  closed-loop  without  destroying  its  stability. 
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I.  Introduction 


Consider  a linear  time-varying  system 

x(t)  = A(t)x(t)  -t-  B(t)u(t)  (1.1) 

y(t)  = C(t)x(t)  (1.2) 

where  x(t:^)  = x^,  and  (^(t),  B(t),  C(t))  are  nxn,  nxm  and  pxn  piecewise  contin- 
ous  matrices.  There  have  been  many  studies  in  obtain in?  stable  linear  feed- 
back controls  with  some  optimal  basis.  The  most  familiar  of  these  is  ob- 
tained from  the  steady  state  optimal  control  for  the  quadratic  cost 


J(u) 


[y'(t)Q(t)y(t)  + u' (t )R(t )u(t )]dt 


(1.3) 


o 

with  a free  terminal  condition,  where  Q(t)  and  R(t)  are  piecewise  continuous 
weighting  matrices  with  0(t)  i 0 and  R(t)  > 0 [1].  Recently  in  [2]  we  suggested 
another  stable  linear  feedback  control  which  is  based  on  a receding  horizon 
notion  applied  to  the  above  quadratic  cost  problem  with  a fixed  terminal  constraint 


x(t^)  = 0.  (1.4) 

This  is  equivalent  to  the  following  integral  constraint  on  the  control 

■Xt  ,t  )n(  t )u(  t )dt  = - x(t  ),  (l.B) 

Jt  ° ° 

o 

where  '>(t,T)  is  the  state  transition  matrix  for  (1.1).  We  have  shown  in  [2] 
that  this  control  law,  which  is  obtained  from  the  above  problem  bv  a receding 
horizon  notion  (t^.  = t + T),  stabilizes  the  linear  svstem  (1.1)  under  certain 
conditions,  and  that  the  gair  matrix  is  obtained  from  a Riccati  differential 
equation  defined  over  a finite  time  interval  of  arlntrarilv  short  duration.  Such 
a receding,  horizon  notion  has  also  been  discussed  by  fhonas  for  time  invariant 
systems  and  with  <)  = 0 [lOj. 


In  addition  to  stability,  the  well-known  free  terminal  steady  state  optimal 
control  of  [1,4]  possesses  the  important  property  that  it  tolerates  certain 
nonlinearities  in  the  feedback  loop  without  destrovinp  stability.  Specifically, 
the  resulting  system  has  been  shown  in  [5,9]  to  possess  infinite  gain  margin 
and  a 60°  oh^se  margin  under  certain  conditions  which  include  Q(t)  i al  for 
some  a > 0.  The  m.odified  receding  horizon  control  in  [2]  can  be  shown  to  possess 
the  same  property.  These  kinds  of  relationships  betv;een  system  coefficients 
and  the  tolerance  of  nonlinearities  in  the  feedback  loop  haye  been  extensiyely 
studied,  particularly  the  absolute  stability  problem  for  time  inyariant  systems 
[7,8],  commencing  with  Lure. 

The  purpose  of  this  paper  is  to  introduce  an  optimal  regulator  problem 
which  is  different  from  the  conyentional  linear  quadratic  cost  problem  but  whose 
modification  leads  to  another  stable  linear  feedback  control  with  tolerance 
for  a large  class  of  nonlinearities  in  the  feedback  loop.  In  Section  II  a double 
integral  control  energy  problem  with  a compatible  constraint  is  formulated  and 
its  optim.al  open-loop  control  solution  is  transformed  to  a closed-loop  linear 
state  feedback  control.  In  Section  III  it  is  shown  that  the  modified  control 
law  obtained  by  the  receding  horizon  notion  is  a uniformly  asymptotically  stable 
control  which  allows  for  a large  class  of  nonlincarities  in  the  feedback  loop. 

The  corresponding  results  for  linear  time  inyariant  systems  are  stated  in  Section 


II,  A Double  Integral  Minimum  Control  Tnergy  Problem 


Tor  the  giyen  system  (1.1)-(1.2),  wo  consider  a double  integral  control  energy 
funct ion 


t ' t 
o o 


J(u) 


U ' ( S )R(  S )u(  S )dc:iT 


(2.1) 
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and  a constraint  which  is  similar  to  (1.5)  but  is  compatible  with  the  cost 
function  (2.1): 


t •'  t 
o o 


^(t  ,s)B(s)u(s)dsdT  = -x(t  ). 
o o 


(2.2) 


From  the  variation  of  constants  formula,  the  constraint  (2.2)  on  the  control 
is  equivalent  to  the  following  integral  constraint  on  the  state: 


x(t  ) + <t>(t  ,s)x(s)ds  = (t^  -t  )x(t  ). 

o o’  f o o 

o 


(2.3) 


The  solution  to  the  above  problem  is  given  as  follows, 


Theorem  2.1.  The  optimal  control  law  for  the  system  (1.1)  which  minimizes  the 
cost  function  (2.1)  subject  to  the  constraint  (2.2)  (or  equivalent  7 (2.3))  is 


given  by 


u(t)  = - R“^(t)B'(t)'5'(t  ,t)P"^(t  ,t^)x(t  ) 

o o f o 


(2.4) 


where 


f^f 

,’^f^  = 

•’t  •'t 


'{'(t  ,s)B(s)R  (s )B  ' (s)<!> ' ( t ,s)dsdT.  (2.5) 

o o 


Proof:  The  proof  is  straightforward  bv  introducing  a Lagranre  multiplier  X for 
the  constraint  (2.2).  The  necessary  condition  for  minimizing  the  cost  function 
f^f  1 

J,(u)  = (tt  u'  (s)R(s)u(s)  + X'^(t  ,r.)B(r.)u(s)}dsdT 

^ Jt  Jt  ^ 


is  that  X = 0,  i.e., 
3u 


R(g)u(s)  t n'(s)J'(t  ,s)  X = 0. 

o 


Since  the  controls  in  (2.1)  are  nore  heavily  weiphtod  around  the  initial  time 
than  the  terminal  time,  it  is  believed  that  the  control  mahnitude  of  (2,4)  tends 
to  be  smaller  than  that  of  a conventional  optimal  control  solution  near  the  in- 
itial time.  It  is  noted  that  the  optimal  open-loop  control  (2.4)  does  not  ruar- 
antee  the  terminal  constraint  x(t^)  = o because  the  constraint  (2.3)  does  not 
necessarily  imply  (1.4). 

A closed-loop  control  can  be  defined  from  the  open-loon  control  (2.4)  )'v 

replac’nr  t with  t as  follows; 
o 

u(t)  = -R"^(t)D'(t)P”^(t,t^)x(t)  (2.8) 

where  P(t,tp)  is  riyon  by  cither  (2.b)  or  (2.G)  with  t^  replaced  by  t.  It  is 


noted  that  the  closed-loon  control  (2.8)  tries  to  satisfy  the  sl'Minc,  constraint 
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In  addition  to  the  infinite-time  optimal  control  of  [!»*+]  and  the  modified 
receding  horizon  control  of  C2],  we  introduce  in  this  section  another  stable 
linear  feedback  control  law  from  (2.8)  which  requires  the  integration  of  a 
Riccati  differential  equation  over  a finite  time  interval  and  allows  a large 
class  of  nonlinearities  in  the  feedback  loop  without  destroying  stability. 

The  following  definitions  are  necessary  for  further  analyses. 

Definition:  The  pair  {A{t),B{t)}  is  uniformly  completely  controllable  if  there 
exists  a positive  constant  6^  > 0 such  that  the  following  two  conditions  are 
satisfied: 


1 ^ W(t,t+6^)  $02  I.  for  all  t 


(3.1) 


l|4>(t,T)ll  $ o,(lt-Tl),  for  all  t,T 


(3.2) 


x(t)  + 


«(t,3 

•'t 


)x(s)ds  = (t^-t)x(t) 


(2.9) 


at  each  instant  "t".  From  this  constraint  it  is  readily  seen  that  the  closed-loop 
control  (2.8)  guarantees  the  terminal  constraint  x(t^)  = 0,  in  contrast  with 
the  open-loop  control  (2.4),  as  can  be  seen  by  letting  t approach  t^  in  (2.9). 

It  is  the  control  (2.8)  that  will  be  modified  by  the  receding  horizon  notion 
in  order  to  obtain  a stable  linear  feedback  control  which  satisfies  the  properties 
mentioned  in  the  Abstract. 


Ill,  A Modified  Regulator  for  Time-Varying  Systems 


wViaw  the  rcntrollahilitv  "'atrix  V(t  ,t, ) is  '’c^^ined  W 

o 1 


.•'i'  = “S’ 


t)B(t)B'(t)«>'(t  ,t)dt 
o 


(3.3) 


and  is  the  state  transition  matrix  for  (1.1).  Inequalities  (3.1)  and 

(3.2)  are  presumed  to  held  for  some  nositive  constants  and  a^,  and  for  an 

appropriate  function  which  maps  R into  R and  is  bounded  on  bounded  in- 

tervals. 

Uniform  comnlete  observability  of  the  pair  {A(t),C(t)}  is  defined  similarly 
with  the  ohservahilitv  matrix 


'!(t 

o 


o 


<>'(t,t  )C'(t)C(t)?(t,t  )dt 
o o 


(3.4) 


corresponding  to  an  aopropriate 
matrix  Riccati  equation 


6 > 
o 


0. 


Lot  6 = max  {6  ,6  }. 

c o 


*Iov;  consider  a 


- f— P(s,t)  = - A(s)P(s,t)  - P(s,t)A'(s)  -P(s,t  )C'(s)Q(s)C(s)P(s,t) 

o S 

+ R(s)R~^(s)B’(s)  (3.5)  ' 

with  the  boundary  condition  P(t,t)  = 0.  The  Riccati  equation  (3.5)  comes  from 

the  fixed  terminal  quadratic  cost  problem  (l.l)-(1.4).  Note  that  Enuation  (2.7) 

can  be  obtained  from  (3.5)  by  taking  Q(s)  = 0.  The  following  results  are  necessary 

for  the  main  theorem.  ' 

Lemma  3.1.  Assume  that  a,  I S R(t)  f I where  a,  and  are  positive  constants. 

I - ■ ■ ■ 4 j 4 b 

(a)  Assume  that  0 S C(t)  J n.  I for  some  a.  > 0.  If  the  pair  {A(t),  R(t)}  is 

0 6 

uniformly  completelv  controllable  and  C(t)  is  bounded,  i.o.,  |Ic(t)||  S for 

some  a_  > 0,  then  for  a fixed  T satisfvinp  5 < T < <”  there  exist  positive 

7 c 

# 

constants  a and  a such  that  r(t,ttT)  obtained  from  (2.6)  and  (3.5)  satisfies 

O J 

I S f(t,t+T)  sal  for  all  t.  (3.6) 

(b)  Assum.o  that  a,^  I s 2(t)  s I for  some 


> 0. 


If  the  pairs  (A(t),D(t)) 


7 


and  {A(t),C(t))  are  uniformly  controllable  and  observable  respectively,  then  for 
a fixed  T satisfyin?^  6 < T < <»  the  relation  (3.6)  holds. 

Proof:  (a).  It  is  shown  in  [2]  that  there  exist  positive  constants  a , and  a 

11  12 

such  that 


I J P(t,t+T)  5 ^ all  t. 


(3.7) 


The  upper  bound  of  (3.6)  follows  from  the  fact  that 


•t+T  ft+T 

P(t,T)dT  s P(t,t+T)  dT  $ TP(t,t+T)  5 T a 

Jt  .ft  ^ 


since  P(t,T^)  $ P(t,T2)  for  t $ The  lower  bound  follows  from 


r ^+6  t+T 

P(t,T)  dt  = P(t,x)dT  + P(t,T)dT 

L -'t+6 


P(t,T)dT  + 


P(t,t+6  )dT  i (T  - 6 )a,,. 
c c 11 


(b)  The  dual  form  of  inequality  (3.7)  can  be  found  in  [l]  and  [6].  Inequality 
(3.6)  follows  from  the  above  arj^ument.  This  completes  the  proof. 

By  slidinf;  the  terminal  time  of  (2.8),  i.e.  invoking  the  receding  horizon 
notion,  we  now  introduce  a modified  control  law: 


u(t)  = -R"^(t)B'(t)P"^(t,t+T)x(t),  T > 6 


(3.8) 


where  P(t,t+T) 


P(t,T)dT  and  P(t,T)  is  obtained  from  (2.7)  hv  intepratinr. 


backward  from  s = x to  s = t . It  is  easily  seen  that  (3.8)  is  the  optimal  control 
for  the  svstem  (1.1)  which  minimizes  the  movinr  cost  function 


■t+T  rx 

u' 

U ■’t 


(s)P-(s)u(5)dsdT 


(3.9) 


with  the  moving  boundary  condition  on  the  control 


or  equivalently  with  the  moving  boundary  condition  on  the  state 


Important  properties  of  the  control  law  (3.8)  which  will  now  be  demonstrated 


are  (i)  the  closed-loon  system  is  asymptotically  stable,  (ii)  the  gain  matrix 


in  (3.8)  is  obtained  from  integrating  a Riccati  equation  over  a finite  time 


of  nonlinearities  in  the  feedback  loop  corresponding  to  a chosen  small  value 


of  T.  In  the  following  theorem  the  matrix  P(t,t+T)  in  (3.8)  is  assumed  to  be 


obtained  from  (3.5)  instead  of  (2.7)  in  order  to  facilitate  a weighting  of  the 


state  through  Q(t) 


Theorem  3.1.  (a)  Assume  that  a I $ R(t)  S Oj.  I and  0$Q(t)  Ja.I.  If  the 


pair  {A(t),  B(t)}  is  uniformly  comnletely  controllable  and  C(t)  is  bounded,  then 


for  a fixed  T satisfying  6 < T $ 2,  the  system  (1.1)  with  the  control  law  (3.8) 


is  uniformly  asymptotically  stable  where  P(t,t+T)  is  obtained  from  (3.5).  Further 


more,  the  system  (1.1)  is  uniformly  asymototicallv  stable  with  either  of  the 


following  control  law; 


where  T and  the  mxm  time-varying  matrix  i|/(t)  are  such  that 


or,  (ii) 


u(t)  = C(-R'^(t)B'(t)p“^(t,t+T)x(t)),  T > 6 


( 3.14) 


where  £;(')  maps  (jf'*  into  (jf’  such  that 


w'R(t)C(w)  i y w'R(t)vj, 


(3.15) 


(b)  Assume  that  I $ R(t)  s.a^  I and  I $ Q(t)  5 I,  If  the  pairs 


{A(t),  3(t)}  and  {A(t),  C(t)}  are  uniformly  completely  controllable  and  obser- 


vable respectively,  then  the  above  results  in  (a)  hold  with  6^  replaced  by  6. 


Proof:  Take  as  a Lyapunov  function  for  the  closed-loop  system  (l.l)-(3.8)  with 


(3.5), 


V(x,t)  = x'P  ^(t,t+T)x. 


(3.16) 


From  Lemma  3.1(a)  the  function  (3.16)  has  uniform  lower  and  upper  bounds.  The 


derivative  of  (3.16)  along  the  solution  is  given  by 


V(x,t)  = x'P'^(t,t+T)x  + x'P  ^(t,t+T)x  + x'  P"^(t,t+T)x 


= x'{(A(t)-B(t)<>(t)R  ^(t)B'(t)P"^(t,t+T))'P  ^(t,t+T) 


+ P"^(t,t+T)(A(t)-B(t)ip(t)R"^(t)B'(t)P"^(t,t+T)) 


P"^(t,t+T)[|^  P(t,t+T)]p“^(t,t+T)}x. 


(3.17) 


By  integrating  both  sides  of  (3.5)  we  obtain 


^ P(t,T)dT  = -A(t)  P(t,T)dT-  i j(t,T)dT  A'(t) 


P(t,T)C'(t)Q(t)C(t)P(t,T)dT  + B(t)R~  (t)B'(t)dT 


•A(t)P(t,t+T)  -P(t,t+T)A' (t) 


r(t,T)C'(t)Q(t)C(t)r(t,T)dT 


+ T 3(t)R‘^(t)B'(t), 


(3.13) 
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from  which  follows  that 


P“^(t,t+T) 


/•t+T 


- ^P(t,T)dT  P -"(t.t+T)  = -P  -"(t.t+DACt)  - A'(t)P"-'(t,t+T) 


' (t- 


+ T P“^(t,t+T)B(t)R"^(t)B’ (t)P“^(t,t+T) 


-P“^(t,t+T) 


rt+T 


s-1. 


P(t,T)C'(t)Q(t)C(t)P(t,T)dT  P (t,t+T). 


(3.19) 


Also  we  have  that 


t+T 


^P(t,t+T)  = ~ I P(t,T)dT  = P{t,t+T)-P(t,t)  + 
••t 


rt+T. 


?(t,T)dT.  (3.20) 


Combining  (3.17),  (3.19)  and  (3.20)  yields 


V(x,t)  = - x'P  ■^(t,t+T)  {P(t,t+T)  + j |^P(t,T)dT  - I |^P(t,T)dT 


i: 


t+T 


+ P(t,T)C’(t)Q(t)C(t)P(t,T)dT  + B(t)R  (t)[v'(t)R(t) 


+ R(t)i>(t)  - TP.(t)]R"^(t)B'(t)}P  ^(t,t+T)x 


$ - x'P"^(t,t+T)P(t,t+T)p'^(t,t+T)x 

provided  condition  (3.13)  holds.  Prom  (3.6)  and  (3.7)  it  follows  that 
V(x,t)  5 “g  1^1  ’ which  implies  that  the  closed-loop  system  is  uniformly 

asymptotically  stable.  The  proof  concerning  the  control  (3.1‘0-(3.15)  is  exactly 
the  same  as  above.  With  ’>(t)  = I,  or  = w,  it  is  clear  that  T must  be  no 

greater  than  2.  The  proof  of  part  (b)  is  also  exactly  the  same  as  part  (a).  Thi 
completes  the  proof. 


1 


-li- 


lt is  noted  that  nonlinearities  satisfying  >J>(t)  i ^ are  allowed  in  the 
feedback  loop  for  a single  input  system  and  that  for  a given  nonlinear  function 
it  is  possible  to  find  a suitable  value  of  T in  order  to  obtain  a stable  system. 
It  is  also  noted  that  the  Lyapunov  function  (3.16)  corresponds  to  the  original 
system  (l.l)-(3.8),  while  that  used  for  the  proof  of  the  control  in  [2]  was  de- 
fined for  the  adjoint  system  in  order  to  include  the  special  case  of  Q(t)  = 0. 

It  is  interesting  to  investigate  some  relationships  betv;een  the  modified  control 
law  (3.8)  and  the  open  loop  control  law  (2.4)  from  which  the  control  law  (3.8) 
is  obtained.  He  show  in  Theorem  3.2  that  the  double  (resp.  single)  integral 
quadratic  cost  for  the  system  (1.1)  with  the  control  (3.8)  is  not  larger  than 
the  double  integral  quadratic  cost  of  the  open  loop  control  (2.4)  with 

t_=t  +Tift,  -t  +T$2  (resp.  T $ 1). 
to  1 o 

Theorem  3.2  The  single  and  double  integral  quadratic  costs  for  the  system  (1.1) 
with  the  control  (3.8)  satisfy  the  following  bounds: 


~-l 

u' (t)R(t)u(t)  dt  $ x'P  (t  ,t  +T)x  if  T $ 1, 

o o o o 


(3.21) 


^1  r '-1 

u'(t)R(t)u(t)dtdT  $ x'P  (t  ,t  +T)x  if  t,  - t + T $ 2.  (3.22) 

oooo  lo 


o o 


Proof:  Let  F(t)  = A(t)  - B(t)R  ^(t)B'(t)P  ^(t,t+T)  and  let  4>„(t,t  ) be  the  state 

F o 

transition  matrix  for  F(t).  The  single  integral  quadratic  cost  with  the  feed- 
back control  (3.8)  is  given  by  xh'i(t^,tj^)x^  where  N(t^,t^)  is  the  solution  of 
the  matrix  Riccati  equation 


- If  = r’(t)iUt,t^)  + N(t,t^)F(t) 


t P'^(t,t+T)B(t)R‘^(t)D'(t)P"^(t,t+T)  (3.23) 


i 
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' with  the  boundary  condition  M(t^,tj^)  = 0.  Fron  (3.20)  and  (3.18)  we  have 
- ^f’"^(t,t+T)  = F'(t)F^(t,t+T)  + P"^(t,t+T)F(t)  + 

(2-T)P'^(t,ttT)B(t)R"^(t)B'(t)P“^(t,t+T)  + P"^(t,t+T)  [P(t,t+T)  + 


P(t,T)C'(t)Q(t)C(t)P(t,T)dT]  P"  (t,t+T).  (3.24) 


Let  E(t)  = L'(t,t^)  - P ■‘■(tjt+T).  Then  from  (3.23)  and  (3.24)  we  have 
- E(t)  = F'(t)E(t)  + E(t)F(t)  - (l-T)P"^(t,t+T)B(t)R"^(t)B'(t)P"^(t,t+T) 


('■  '-1 
- P (t,t+T)CP(t,t+T)  + P(t,T)C'(t)Q(t)C(t)P(t,T)dT]P  (t,t+T) 


(3.25) 


with  the  boundary  condition  E(tj^)  = -P  (t^,t^+T).  From  (3.25)  it  follows  that 

iKt^.t,)  - P~^(t  ,t  +T)  = E(t^)  $ - 0'(t.  ,t  )p“^(t.  ,t,+T)«^„(t.  ,t  ) (3.26) 

Ol  OO  O rlO  llFlO 


provided  T $ 1.  In  a similar  way  it  can  be  shown  that  the  double  intepral  cost 
with  the  control  (3.8)  is  given  by  x';i(t  ,t,  )x  where  II(t  ,t,)  is  the  solution 

OOJ.O  ox 

of  the  matrix  Riccati  equation 


l^fKt.t^)  = F'(t)ri(t,t^)  + l'i(t,t^)F(t) 


+ (t^  - t)P“^(t,t+T)B(t)R”^(t)B'(t)P"^(t,t+T) 

with  the  boundary  condition  N(tj^,t^)  = 0.  Let  C(t)  = ll(t,t^)-r  ^(t,t+T). 

Then  we  have 

-E(t)  = i •(t)E(t)tE(t)F(t)-(t-t^  + :-T)F"^(t,ttT)B(t)ir^(t)B'(t):'‘^(t,t^T)  - -/.(t). 
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where  Z(t)  is  the  nonnegative  definite  last  term  of  (3.25).  Thus  we  have  a similar 
inequality  as  (3.26)  provided  T + t^^  - t^  S 2.  This  completes  the  proof. 

The  relation  (3.21)  shows  that  the  infinite  time  control  energy  is  bounded 
if  the  system  is  uniformly  completely  controllable.  The  results  in  this  section 
have  simpler  forms  in  the  case  of  time  invariant  systems  as  shown  in  the  next 
section. 

IV.  Linear  Time  Invariant  Systems 

In  this  section  consider  a linear  time  invariant  system 

x(t)  = Ax(t)  + Bu(t),  x(t  ) = X (4.1) 

o o 

y(t)  = Cx(t)  (4.2) 

where  {A,B,C}  are  constant  matrices.  If  the  pair  {A,B}  is  completely  controllable, 
the  minimization  of  the  double  integral  quadratic  control  energy  (2.1)  with  a 
constant  matrix  R subject  to  the  constraint  (2.2)  or  (2.3)  leads  to  the  optimal 
control  law 

u(t)  = - R"^B'P'^(t^-t)x(t)  (4.3) 

where 

P(t)  = [ [ e~^^BR"^B'e"^'^dsdT.  ’ (4.4) 

■*0  ^ o 

The  result  analogous  to  Theorem  3.1  for  the  time  invariant  case  is  contained  in 
the  follov/ing. 

Theorem  4.1  If  the  pair  {A,D}  is  completely  controllable,  then  the  system  (4.1) 
is  uniformly  asymptotically  stable  with  the  feedback  conti';’.  law 

u(t)  = -R"^B'P‘^(T)x(t)  (4.5) 

where  0 $ i s 2 and  P(T)  is  obtained  from 


-14- 


P(T)  = I P(t)dt  (4.6) 


and  P(t)  is  given  by 

4rP(t)  = -AP(t)  - P(t)A’  - P{t)C’QCP(t)  + 


P(0)  = 0 


(4.7) 


for  any  Q i 0 and  R > 0.  Furthermore,  the  system  (1.1)  is  uniformly  asymptoti 
cally  stable  with  the  control  law: 


u(t)  = - i;/(t)R"^B'P"^(T)x(t),  T > 0 (4.8) 

where  T and  :|)(t)  satisfy  the  relation  (3.13)  with  a constant  matrix  R,  or  with 
the  control  law 

u(t)  = C(-R"  B'P"^(T)x(t)),  T > 0 (4.9) 

where  T and  C(*)  satisfy  the  relation  (3.15). 

From  the  special  structure  of  a time  invariant  system,  the  condition  of 
Theorem  4.1  can  be  weakened  as  follows. 

Proposition  4.1  If  the  pair  {A,B}  is  stabilizable , then  the  system  (1.1)  is 
uniformly  asymptotically  stable  with  the  feedback  control 

u(t)  = - R‘^B'p'(T)x(t),  0 < T S 2 (4.10) 

where  P (T)  is  the  generalized  inverse  of  the  matrix  P(T). 

The  proof  of  the  above  result  is  similar  to  the  one  in  [2]  and  is  thus 
omitted  here.  It  is  noted  that  for  the  special  case  of  Q = 0,  P(t)  in  (4.3) 
and  (4.5)  can  be  obtained  very  easily  by  a single  integration  as  shown  below. 
Integrating  both  sides  of  (4.7)  with  Q = 0 we  obtain 

P(t)  = - AP(t)  - P(t)A'  + tBR'S’  . (4.11) 

From  (4.6)  it  follows  that  P(t)  = ^ P(t)  so  that  P(t)  satisfies 

at 

~P(t)  = - Ar_(t)  - r(t)A'  t tER"^D'  (4.12) 


P(0)  = 0. 
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The  control  (4.5)  with  P(T)  obtained  from  (4.12)  is  as  easy  to  obtain  as  the  one 
in  [3],  but  (4.5)  has  been  shown  here  to  allow  a larger  class  of  nonlinearities 
in  the  feedback  loop. 

Some  characteristics  of  t}ie  control  (4.5)  in  the  frequency  domain  will  now 
be  considered.  For  notational  convenience  we  define 


A --i  A '-1 

K = P (T),  L = -R  B'P  (T), 


and 


(4.13) 


- A -_i 
Q = P (T) 


P(t)C'QCP(t)dt  P”^(T)  + P"^(T)P(T)P""(T). 


Then  (4.7)  is  equivalent  to  the  following  matrix  equation: 

A'K  + KA  - T KBR'^B'K  + Q = 0. 


(4.14) 


By  a slight  modification  of  the  methods  introduced  in  [5],  (4.14)  can  be  shown  to 
yield  the  following  relation  in  the  frequency  domain: 


& 


I - L(-sI  - A) 


I - R'^  L(sl  - A) 


~ I + i R“’''B'(-sI  -A')’^  Q(sI  - A)"^BR‘'^, 
T ^ 


(4.15) 


Relation  (4.15)  also  implies 


& 


I - R'^L(-ju)I  - A) 


HR-'il  [i 


I - R"L(juI  - A) 


■^BR"'^j 


5 — 1 

n.2 

T- 


(4.16) 


For  simplicity  we  consider  a single  input  system  whore  R can  be  taken  as  unity, 
B = b is  a column  vector  and  L = £ is  a row  vector.  Relation  (4.16)  can  be 
expressed  in  this  case  as  follows: 


Y - 4(jwi  - A)‘^b  ^ Y • 


Inequality  (4.17)  implies  that  the  Nyquist  plot  of  the  closed-loop  transfer 
function  - 2(ju)I  - A)  lies  outside  a circle  centered  at  - i + io  with  radius 
^ as  shown  in  Fig.  2.  The  key  observation,  as  noted  in  [5],  is  that  the  number 
of  encirclements  of  any  point  inside  this  circle  is  the  same.  Theorem  4.1  in 
conjunction  with  inequality  (4.17)  shows  that  the  number  of  encirclements  of 
points  lying  on  the  real  axis  inside  this  circle  is  the  same  as  the  number 
of  poles  of  the  transfer  function  in  the  closed  righthand  plane.  From  these 
properties  and  Fig.  2 it  is  clear  that  the  system  employing  the  control  law 
(4.5)  has  an  infinite  gain  margin  and  the  following  phase  margin: 


Thus,  0=0  for  T = 2,  0 = 60**  for  T = 1,  and  0 -►  90°  as  T -►  0.  From  the  well 

T 

known  circle  criterion  the  nonlinear  function  i()(t)  i ^ can  be  allowed  in  the 
closed-loop  without  destroying  its  stability.  This  is  consistent  with  Theorem 

4.1. 
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V.  Conclusions 


It  has  been  recognized  that  relatively  few  optimal  control  problems  lead 
to  stable  linear  feedback  control  laws , particularly  for  time-varying  linear 
systems.  In  this  paper  an  unconventional  optimal  control  problem  has  been 
introduced  whose  modification  leads  to  a stable  control.  Since  the  stabilizing 
feedback  gains  are  obtained  by  integrating  a Riccati  equation  over  a finite 
time  interval,  its  computation  should  be  easy  with  modern  computers.  The 
control  law  (4,5)  employing  (4.12)  is  one  of  the  easiest  ways  to  stabilize  a 
linear  time  invariant  system  like  the  one  in  [3],  since  the  gain  matrix  can  be 
obtained  by  a single  integration  of  a Riccati-type  equation  as  indicated  in 
(4.12).  The  results  indicate  that  a short  receding  horizon  distance  T should 
be  chosen  when  large  nonlinearities  exist  in  the  feedback  loop.  Stabilization 
with  a prescribed  degree  of  stability  can  be  obtained  exactly  the  same  as  in  [2]. 


issssmm 


Nonlinearity 


x(t)  = A(t)x(t)  + B(t)u(t) 


Fig.  1.  Nonlinearity  in  the  Feedback  Loop 


A Nyquist  Plot  of  -£(ju)I-A)  '^b 
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